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Abstract. Asymptotic error distribution for approximation of a stochas- 
tic integral with respect to continuous semimartingale by Riemann sum 
with general stochastic partition is studied. Effective discretization schemes 
of which asymptotic conditional mean-squared error attains a lower 
bound are constructed. Two applications are given; efficient delta hedg- 
ing strategies with transaction costs and effective discretization schemes 
for the Euler-Maruyama approximation are constructed. 

1. Introduction 

The present article studies the asymptotic distribution of a sequence of 
continuous processes Z" = {Z"}t(=iQj) defined as 




for one-dimensional continuous semimartingales X = {Xf, 'Ft}, Y = {Yt, ft] 
and sequences of {;Ft}-stopping times t" = {t"} with 

(2) = t" < < • • • < t'/ < . . lim t" = r a.s., 

where T £ (0, oo] is fixed and we suppose that the intervals t"^^ -t" converge 
to as n —> oo in a sense specified later. Taking into mind the definition of 
stochastic integrals, the asymptotic behavior of Z" is theoretically of inter- 
est. There may be no need to explain its practical importance. This problem 
was studied by Rootzen 12TI in the case that Y is a Brownian motion and 
the asymptotic distribution was specified in the case that t" = ;/n and 

Xs = f{Ys, s) for a smooth function /. Jacod [9J treated a related problem on 
the condition that each interval t^^^ - t" is measurable. Jacod and Prof- 
fer IITOl considered the case X = Y and t" = jln and derived the asymptotic 
error distribution of the Euler-Maruyama scheme for stochastic differen- 
tial equations. Hayashi and Mykland [7] discussed this problem again for 
the case t" = jln in the context of discrete-time hedging error. Geiss and 
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Toivola [61 treated an irregular deterministic discretization scheme. The 
condition t" = jln, or more generally, that each interval z"^^ - t" is 

measurable, played an important role in those preceding studies of central 
limit theorem. On the other hand, Karandikar [12] constructed a discretiza- 
tion scheme t" such that Z" converges to almost surely. Since the almost 
sure convergence is not usually attained by the time-equidistant scheme 
t" = j/n, it means Karandikar 's scheme is more effective in a sense. The 
scheme is defined using passage times of X, so that the measurability condi- 
tion, that each interval t"^^ - z" is T^i" measurable, is not satisfied. Recently, 

Fukasawa HI gave the asymptotic distribution for such a scheme and Fuka- 
sawa |5j extended the results to a more general scheme in the case X — Y. 
The present article extends those limit theorems and constructs effective 
discretization schemes of which asymptotic conditional mean-squared er- 
ror attains a lower bound. In particular, Karandikar 's scheme is shown to 
be superior to the time-equidistant one also in terms of mean-squared error. 
An application to delta hedging with transaction costs is given, which can 
be directly used in practice. It remains for further research to extend the 
results to discontinuous semimartingales. In Section 2, we describe and 
prove main results. Effective discretization schemes are constructed in Sec- 
tion 3. The application to hedging is presented in Section 4. An application 
to the Euler-Maruyama approximation is given in Section 5. 



2. Central Limit Theorem 

2.1. Notation and conditions. Here we give a rigorous formulation and 
describe several conditions on X, Y and z". Let (Q, "F, {!Ft}t>0/ P) be a filtered 
probability space. The filtration {Tt] assumed to satisfy the usual condi- 
tions. We denote by Fi • F2 the Stieltjes integral or the stochastic integral of 
Fi with respect to F2. 

Let us recall the definition of stable convergence. Let E be a complete 
separable metric space and F" be a sequence of E-valued random variables 
defined on {Cl,T,P). 

Definition 2.1. For a sub o-field Q <zf,we say F" converges Q-stably if for all 
@-measurable random variable Fq, the joint distribution (F",Fo) converges in law. 

Our main results stated in the next subsection are stable convergences of 
Z" defined as (dJ with continuous semimartingales X and Y. Notice that 
a stable convergence is stable against, in particular, the usual localization 
procedure as well as the Girsanov-Maruyama transformation. 

Denote by P and Pq the set of the predictable processes and the set of 
locally bounded left-continuous adapted processes respectively. Let T £ 
(0, 00] be fixed. Given a continuous semimartingale M and £ N, put 

= (h £ P; \Ht ■ {M)t < 00 for all t £ [0, T)) . 
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Denote by S the set of the continuous semimartingales (X, Y, M) satisfying 
the following Condition 1221 

Condition 2.2. There exist ip,(p,K e and a locally bounded predictable process 
y eP such that 

X^Xo + ip-{M} + yM, y = Yo + (p • <M) + 

on [0, T), where is a continuous local martingale with 

<M^) = K • <M). 

In addition, M is a continuous local martingale with E[{M}j] < oo. 

The integrability of {M}t is not restrictive in light of the localization 
procedure. In order to describe conditions on t", put 

Gl„ = E[|M,.. ^ -M,«im«], G)„ = E[(M,.. ^ - M,f\r,n] 

for a given continuous local martingale M with E[(M)^] < oo and € N 
with 2 < A; < 12. In addition, put 

(3) N[t"]^ = max{; > 0; < t} 

for a given stopping time t. Denote by 7~(M) the set of the sequences of 
stopping times {t"} satisfying (|2]) and the following Condition 12.31 

Condition 2.3. There exist a sequence e„ with e„ ^ and a,b ePo such that 
and 

GlnfGl„ = 0,{e% G]l/Gl„ = o,{el), 
uniformly in ; = 0, 1, . . . ,N[T"]t/or all t e [0, T). 

Condition |2.3| is slightly stronger than Condition 1 of Fukasawa ||5l; never- 
theless, all examples given by Fukasawa [51 satisfy also this condition. Here, 
e„ serves as a scale of increments of M. Note that G'^^/Gj^ - Op(e^) implies 

G^jJGj^ = Op(e„) by Lemma IBH In usual cases, we have G^^^ = Opie^), 
which in fact holds, for example, if d(M)t/df exists and is bounded and 
if t"^^ - t" is of 0{e\) uniformly. Condition 12.31 is therefore a quite mild 
condition in the context of high-frequency asymptotics. It is often easily 
verified by using the Dambis-Dubins-Schwarz time-change technique for 
martingales when t" is a function of the path of M. Once it reduces to the 
Brownian motion case by the time-change, one can utilize various results 
on Brownian stopping times. See Fukasawa [51 for examples. In light of the 
Skorokhod stopping problem, the distribution of an increment can be any 
centered distribution with a suitable moment condition. The left continuity 
of a^ and b corresponds to a local homogeneity property of the distributions 
of increments. It should be noted that sup ■ |t"^^ A i - t" A f| — > does not 
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follow from Condition l2.3l nor needed for our main results. See Lemma|A3] 
for what follows instead. 

Denote by T \{M), T i{M) the subsets of T{M) satisfying the following 
Condition 12.41 and Condition 12.51 respectively. 

Condition 2.4. In addition to Condition \2.3\ there exists C, ePo such that 

r^€!Po, e„G)„/Gj„ = C.;+Op(l) 
uniformly in / = 0, 1, . . . ,'N[x^\tfor all t e [0, T). 

Condition 2.5. In addition to Condition \2.3[ there exists q ^Po such that 
q-^ ePo, G^^^ql„el+Op{el) 

uniformly in ; = 0, 1, . . . ,'N[x"\ifor all t e [0, T). 
Finally, for t" with Q and t € [0, T), put 

(4) [Mf.=mr''M-mr"At. 

A" ;+l ; 

2.2. Main results. Here we state general results on the asymptotic distri- 
bution of Z". The proofs are deferred to Section [23l 

Theorem 2.6. Let (X, XM) e S, t" £ T{M) and Z" be defined as ©• Assume 
one of the following two conditions to hold: 

• M is the local martingale part ofX, that is, 

(5) 7^1. 

• for all t e [0, T), 



(6) 



7=0 



for ke {1,2,3, 4, 5}, where [Mf. ^ is defined as ©. 
Then, Z"/e„ converges T -stably to the law of 

(7) 1(^,^).Y+ J_(cy).Y' 
as a C[0, T)-valued sequence, where 

(8) c2 = «2 _ 2j,2^ y ^ ^^^^ 

and W is a standard Brownian motion which is independent to T . 

Note that the asymptotic distribution (O is an ^^-conditionally Gaussian 
process, so that the marginal law is a mixed normal distribution. The 
following theorems give lower bounds for the conditional variance of the 
mixed normal distribution. 
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Theorem 2.7. Let (X, XM) e S, t" e Ti(M) and Z" be defined as Q. Let uePo 
and put 

CO 

;=0 

Then, it holds 

(9) enW; ^ Ut := {\u\Q ■ {M)t 

in probability for all t e [0, T). Moreover, //©or ([6]) holds, then U^Z" converges 
T-stably to the law ofUZ as a D[0, T)-valued sequence, where Z is defined as 0. 
The asymptotic conditional variance Vt of U^Zf with t e [0, T) satisfies 

(10) Vt = ^{cyf ■ {Y)t\{\u\Q ■ {Mhf > g \{\uy\^'\'/') ■ {M)tf a.s.. 

Theorem 2.8. Let {X, XM) e S, t" £ T2{M) and Z" be defined as ©. Then, it 
holds that 

(11) e^N[T"]t ^ Nt := q-^ ■ (Mh 

in probability for all t e [0, T). Moreover, if ^ or ^ holds, then ^JN[T"]Z^ 
converges f -stably to the law of y/NZ as a D[0, T)-valued sequence, where Z 
is defined as 0. The asymptotic conditional variance Vt of ^JN[%"\tZ1 with 
t € [0, T) satisfies 

(12) Vt = -^{cyf ■ {Y)t q-^ ■ {M)t > \ ((|y| V^) • {M)tf a.s.. 

Note that the right hand sides of JTOl ), ((12|) do not depend on the discretiza- 
tion scheme t". In Section |3l we construct schemes t" £ 7"i(M) n 72 (M) 
which attain the lower bounds ([lOl ), ([12|) respectively. Its practical meaning 
is discussed in Sections 4 and 5. 

The condition ^ will be easily verified especially if M is a Brownian 
motion. Condition |2.3| is then also easily verified if, in addition, t" satisfies 
the condition that each interval t" ^ - t" is Tt-h measurable. It is therefore 

not difficult to recover the preceding results from Theorem |2.6[ An irregular 
scheme treated in Geiss and Toivola IH is an example. 

2.3. Proof for Theorems. Here we give proofs for main results stated in 
the previous subsection. 

Lemma 2.9. Let M be a continuous local martingale with £[(M)^] < oo and 
t" € 7~(M). Let H, y be a locally bounded cag adapted process and a locally 
bounded predictable process respectively. Put M = y - M and define H", M" as 

(13) = H^n, Ml = M^n, for j > with s e [t^, t^^^). 

Assume one of the following two conditions: 

• y = 1, 

• ©holds. 
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Then, it holds 
(14) 



e-\iM-M"m ■ {Mh ^ -ibHy) ■ {M)t, 



€-^{{M-M"fH") ■ {Mh ^ g(fl"H/) • <M)f 



uniformly in t on compact sets of [0, T) in probability. Moreover, 
(15) 

for all t £ [0, r). 



e-4((M-M")4H")-(M)f = Op(l) 



Proof. By the usual localization argument, we can assume H, y, a, b, 
M and (M) are bounded without loss of generality. Let us suppose y = 1. 
Then, for any / € 

e-\{M - M")'H") ■ {Mh 

^e-jyH,n\ai{M,n.t-M,nJ+^ + ^j (Ms-M,-.)'+MMs 

^ ' I ' -'-"^^ ' ] 

by Ito's formula, where a/, j6; are constants only depending on / and, in 
particular, ai = 1/3, a2 - 1/6. Since 

N[t"], 

for / £ 1, 2, 4 by Condition |2]3] and Lemma lA!4l we have 



/-t" , At 



{Ms-M,uf+^dMs 



as well as 



N[t«], 



7=0 7=0 

by Lemma [A.2[ The result then follows from Lemma [A. 4[ 

Next, let us suppose Q and y ^ 1. Note that for all 6 > 0, there exists a 
boimded cag process y^ such that 



E[\y - yY • {Mh] < 6 
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for k = A,6, 12 by Lemma IA31 Notice that for p,q > 1 with 1/p + - 1, 



Jt"m Jt'V 



{ju - Yi)dMu 



7=0 ^^i 



d{M)s 



< E 



sup 



y=0 selT'jAt,T';^^At] 



JT«At 



(y„-7^)dM„ 



7'" 



1^ Jt'fAt 



iyu-ytfd{M)u 

./=o 

by the Holder and the Burkholder-Davis-Gundy inequality. Furthermore, 



pl/2- 



lip 



7=0 



llq 



< E 



< E 



E " \yu-yi?^'d{M)u\[Mtf-^ 

JT'.'Af 



7=0 , 
4 



1/2 



r l7u-7^pP^d<M)„ 
Jo 



1/2 



/=0 



1/2 



For / = 1, we take p — 2 and for / = 2, 4 we take /? = 3/2 to have 



l/(2p) 



.7=0 



1/g 



= 0{ei) 



by the assumption Since 6 can be arbitrarily small, this estimate ensures 
that we can replace M and y by y*^ • M and respectively in ([14]) and ([l5l) . 
Put 

= 7^„forse[T;,T;^^). 



By the same argument, we can estimate 



^ /--T.^jAr /->s 

nT (7'u-7u'")dM. 

Jr'.'At JT«At 

/-U 7 J 



d{M)s 



to ensure that we can replace (M - M") with y^'"(M - M") in (O and flSl ), 
where M" is defined as 

= M^.., for 7 > with s 6 [tJ, tJ^^). 
Then, repeat the proof for the case y = 1 with H replaced with Hy^. //// 
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Proof of Theorem jZl Put M = y • M, A = j/^ • <M) and define A", M" as ^ 
with H — A. Then we have 

Z" = (A - A") • y + ((M - M")<p) • (M) + (M - M") ■ M^. 

We shall prove that 

e-\A - A") -Yt^O, e-H{M - M")(p) ■ (Mh ^ ^(&<p) • <M)t 
in probability uniformly in f on compact sets of [0, T) and that 

(16) D" -.^ e-HM - M") ■ 
converges !F-stably to 

J(&y)-M^ + ^(cy)-y'. 

Step a) Let us show 

(17) e-\A-A'')-Y-o^O 

uniformly in w £ [0, f] in probability. Fix 61,62 > arbitrarily and take a 
bounded cag process ip^ such that 

P[\ip - ip'\^ ■ {M)t > 61] < 62 



by Lemma |A3l Observe that for any v e [0, t], 



(18) =^^?'E I i/'f,d<M)„(p,d<M), 

Jt"av Jt"av 



;_n -Jv Av •Jv.AV 

/^t"^, Ad /^s 
;_n >^t"Ad Jt"Av 



and that 



n'E |l/^fjd<M)„|(p,|d(M), 

r-^ Jx'.'Af Ji'.'Af 



°° n-z" J At 

<CVE[M];.„ \(Pu\d{M)u 



00 I n't" i^t I 

A 1/2 

X^i[M];jn i<p.i'd<M)„ 



<cNe-2 



n 

;=0 
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for a constant C^. Using Lemmas lA.2[|A.4] and the Burkholder-Davis-Gundy 
inequality, we have 

CO 

since 

N[t"]( N[t"], 

e-n' L G% 0, i: G- ^ 

7=0 ;=0 

in probabiHty by Condition l2.3[ Hence the first term of the right hand side 
of ((18)| converges to uniformly in y € [0, t] in probability. For the second 
term, we have 

^n'E - l/^fjd<M)J<p,|d<M), 

JT"At JT"At 



<e-iy - i/.fjd<M)„ |<Ps|d<M), 

I 00 pT" jAf 

< VI'/' - '/''I' • <^>f^"' E a/[^ \(Ps\d{M)s 



1/2 



;=0 I ^^JAf 



1/2 



< 



^|,/;-i/.^'p.<M)J£e-2|[M];j2 . |(p/d<M), 



Using again Lemmas lA.2[|A.4l and the Burkholder-Davis-Gundy inequality, 
we have 

00 

Since 61, 62 can be arbitrarily small, the left hand sum of ([Tsl l converges to 
uniformly in w £ [0, t] in probability. Next, observe that 

{{A - A") • M^) = ({A - A" fx) ■ (M), 



so that it suffices for ((17)) to prove 



e-\{A-A"fK)-{M)t^O 
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in probability, in light of the Lenglart inequality. This follows from 

, 2 



^ /->T jAr /->s I 
^"'E 1 ^udmu\ Ksd{M)s 

<e;;'E \4'u\^d{M}u\[Mf.\ ' Ksd{M)s 

■~T^ Jr'.'At " JT"/\i 

( \l/2 

/-t" jAf oo r „t 

<sup ' IV'ul'd(M)Je;4y |[M];.jn ^2^^^^^ 

;>0 Jz'lAt 77 ^' [Jo 



j>0 JT'lAt I ^.^0 

and noticing that 



1/2 



sup li/'j'cKM),, ^ 0, e-^ y \[M]]f = Op{l) 

/>0 J.'!At 

by Lemmas I A .211 A .41 and Condition l2.31 
Step b) Let us show that 

(19) e-i((M - M")(p) ■ <M)„ ^ ^{bcpy) ■ <M), 

uniformly in z; £ [0, f] in probability. Fix 61,62 > arbitrarily and take a 
bounded cag adapted process cp^ such that 

P[\(p - cp^\^ ■ {M}t > 61] < 62 
by Lemma IA31 Notice that 

(20) \e-H{M-M"){(p-(p'))-mv\< - • {M)t yje-^iM - M")^ ■ {M)t 
and 

e-2(M-M")2.<M)f = Op(l) 
by Lemma |2!9l Note also that 

(21) \{bcp) ■ <M), - (fo/) • <M),| < ^Jh^^Mt ^\<p - cp^P • <M)f. 

Since 61,62 can be arbitrarily small, the estimates (|20l > and (|2T]| ensures 
that we can suppose 9 is a bounded cag adapted process without loss of 
generality. Then, putting 

(p'l = (p,n for i > with s £ [t^", T^^i), 

we have also that 

\e-\{M - M"){<p - cp")) ■ <M)-,| < ^\cp - <p"|2 . (M)f ^e-^M - M«)2 • {Mh. 
Note that 

|<p - cp"\^ ■ {M)t ^ 
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in probability as n — > oo because (p is now assumed to be bounded and left 
continuous. Applying Lemma l2!9l we have ((T9|). 

Step c) Let us study the asymptotic distribution of D" defined as ((T6)) . Put 

D" ^ D" - ^{by) ■ M"^ . 

In light of Theorem lA.il it suffices to show the following convergences in 
probability. 

(1) 
(2) 

{D"h ^ ^(cVk) • {Mh, 

(3) 

{D",M}t ^ 0, 

for all t € [0, T) and for all bounded martingale M orthogonal to M^. The 
last one is trivial. In order to see the first convergence, it suffices to see 

<D",M^)t = e-\{M - M")k) • {Mh ^ ^{byx) ■ {M)t 

in probability. This is shown in the same manner as for ([l9l ). In order to see 
the second convergence, fix 61,62 > arbitrarily and take a bounded cag 
process such that 

P[\k - K'f • <M)f > 61] < 62 
by Lemma IA31 Notice that 

(22) <D") = e-2((M - M")2k^) • (M) + e,;2((M - M"f{K - k^)) ■ <M) 
and the second term is negligible since 

e;2((M - M")2|k - K^l) • <M)t < yje-^M - M")^ • <M)f V|k - k^P • <M)f 

in light of Lemma |Z9] and the fact that 61, 62 can be arbitrarily small. Fur- 
thermore, putting 

= K^„ for > with s e [z^j, t^/^,), 

we can replace with k^'" in the first term of ((22)) by the same argument. 
Then, we have from Lemma |2!9] that 

<D"),^l(flVK)-<M)j 

in probability. Since 

{{by) ■ M^) = {b^y^K) ■ <M), 

it remains only to show 

{D\{by)-M\^^{bYK)-{M)t 
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in probability. Since the left hand side is 

e-\iM-M")byK)-{M}t, 

the convergence follows from the same argument as for ([l9l ). //// 
Proof of Theorem llJ] The convergence @ follows from Condition l2.4l and 
LemmalAll The convergence of U"Z" in D[0, T) is a consequence of the fact 
that the convergence of Z"/en is stable. To show flOl) , we first notice that 

g4 /g2 - -|g3 /G^ |2 > IG^ /G^ |2 a.s., 

],n' j,n ^' j,n' j,n' — ' ],n' j,n' 

which follows from Lemma [B.2I In light of Lemma [A.4I and Condition 12. 4[ 
this inequality implies 

(Hc2) • <M) > (HC"^) • <M) a.s. 
for any H £ !Pj^. Thus we have 

Vt = licY) ■ {y)t\{\u\Q ■ <M)tP 
6 

> liCV) ■ {YhmQ ■ <M)f|2 > 1 |(|m7|2/3k1/3) . <M)i|' 

by Holder's inequality. //// 
Proof of Theorem 12. gl The convergence ((TT]) follows from Condition 12.51 
and Lemma [A.4I The convergence of ^^N[T^Z" in D[0, T) IS a consequence 
of the fact that the convergence of Z" /e„ is stable. To show ((12)) . we first 
notice that 

GlJG% - \GlJGlf > Gl a.s., 

which follows from Lemma [B.ll In light of Lemma [A.4I and Condition |2.5[ 
this inequality implies 

(Hc^) • <M) > (Hq^) ■ <M) a.s. 
for any H £ !P^. Thus we have 

Vt - licY) ■ {Yh q-^-mt 

> -MY) ■ (Yh q-^ ■ <M)f > \{\y\ V^) • {M)t 
6 6 

by the Cauchy-Schwarz inequality. //// 

3. Effective schemes 

Here we give effective discretization schemes. Let {X,Y,M) £ S. For the 
sake of brevity, we suppose T is finite in this section. Then, by a localization 
argument, we can suppose without loss of generality that there exists 6 > 
such that (M) is strictly increasing a.s. on [T - 6, T). In fact, we can consider 
a sequence instead of M defined as, for example, 

Mf = MjAax + - ^tAo^r OK = inf{f > 0; <M)t > X} A (T - 



DISCRETIZATION ERROR OF STOCHASTIC INTEGRALS 



13 



with X — > oo, where W is a Brownian motion defined on an extension of 
(Q, T , P). Recall that stable convergence is stable against such a localization 
procedure. Then, for any positive sequence e„ with e„ — > and for any 
g ^Po with e Po the sequence of stopping times z" defined as 

(23) = 0, t1^, = inf |f > t;;; \Mt - M,n \ = e„g,„ | a T 

satisfies Conditions I2.4l and l2.5l with 

bs = 0, al^ql^ = g^, 

which follows from a famous property on the exit times of one-dimensional 
continuous local martingales. 

Proposition 3.1. Let {X, XM) £ S and u e Pq. The lower hound (lIOl ) is attained 
by t" defined as (l23l ) with g = Iwl^^'^ly-^Kp^^'^ ifS'S~^ ^ ^o- 

Proposition 3.2. Let {X, Y, M) € S. The lower bound (II2D is attained by z" 
defined as (El with g = \y\-^'\-'^/'^ ifg>g~^ ^ 'Po- 

Recall that the lower bound ((12)) was derived from a combined use of 
Lemma [B . 1 1 and the Cauchy-Schwarz inequality. Karandikar [12 J studied a 
scheme which is defined as (|23l) with g — 1 and X instead of M to show the 
almost sure convergence of Z". In case that ip appeared in Condition l2.2l is 
locally bounded and 7 = 1, we can suppose X = M in light of the Girsanov- 
Maruyama theorem. Then, we can conclude that Karandikar 's scheme is 
superior to the usual time-equidistant one in that it yields increments of the 
integrand which attain the equality in Lemma IB. II It is in fact optimal if 

x = y. 

Note that Lemma [B . 1 1 gives a more precise estimate 

c2 • {Yh = {a^ - h^) ■ {Yh > [^b^ + q^) {Y}t. 
The following proposition, for example, is easily shown by this estimate. 

Proposition 3.3. Let {X,Y,M) e S, Z" be defined as ^ and € Pq. Denote 
by 7~(/3, 5) the set of sequences of schemes z" which satisfies Condition 12.51 with 
b = fi and q^ = 5. Then, for all t e [0, T), Z^/e„ converges to a mixed normal 
distribution with the asymptotic conditional mean 

and the asymptotic conditional variance Vt satisfying 

Vt = licY) ■ {Yh > ^ + 5j . ^y^^ ^ g _ 
The equality is attained by t" e 7~i(M) n 7~2(M) defined as 

t" = inf U > t"; Mf - M^" > e^^x" ^/s^ or Mt - Mj" < -Snk'} J^'\ 
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with Tq = 0, where 



2 

4. Conservative delta hedging 

This section treats conservative delta hedging of Mykland fTB\ as an 
example of financial applications. This framework includes the usual delta 
hedging for the Black-Scholes model; even for this classical model, results 
presented in this section give a new insight and a new practical technique 
for hedging derivatives. Let S be an asset price process and assume that 

dSt = StifLtdt + otdWt) 

is satisfied for predictable processes [i and o and a standard Brownian 
motion W. Consider hedging an European contingent claim /(St) for a 
convex function / of polynomial growth. Define a function p as 

p{S, R, Z) = e"^ I / (S exp{K - E/2 + Vfz}) (/)(z)dz 
Jr 

where (p is the standard normal density. Changing variable, it can be shown 
that 

^^^^ dZ ~ 2^ dS^' dR'^dS 

Put 

7]K = inf{f>0;<log(S))f >X} 
for K>0,Vt^ e-'^Vi, St = e-'^St and 

Ef = K-<log(S)),^,,^, yf = p(Sf,r(T-0,i:f), 71, = ^(Sf,r(T-0,Sf), 
where r > is a risk-free rate. Then, Ito's formula and yield 

r-tArjK 

Jo 

for f € [0, T], that is, the portfolio strategy (71°, 71) with 7Z° = ^"'''(1/, - rctSt) 
is self -financing up to 1]k T. Moreover, the convexity of / and l|24)l imply 
that p is increasing in Z,, so that 

Vt > p{St, 0, 0) = /(St) on {t]^ > T}. 

Note that p is the Black-Scholes price with cumulative volatility K and that 
71 is the corresponding delta hedging strategy. The above inequality ensures 
that the delta hedging super-replicates any European contingent claim with 
convex payoff on the set {r]i<: > T}. AsK^ <x),P[riK > T] ^ 1, so that a hedge 
error due to the incompleteness of market converges to 0. Contracts such 
as variance swap serve as insurances against the event rjK <T for predeter- 
mined K which is not so large. See Mykland [19J for an improvement of this 
conservative delta hedging. The purpose here is, however, not to treat such 
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a hedge error due to the incompleteness but to treat a hedge error due to 
the restriction that trades are executed finitely many times in practice. Note 
that the rebalancing of a portfolio is usually executed a few times per day 
while observation of S is almost continuous. Hence, the estimation error of 
(log(S))f appeared in is negligible compared to the discrete hedging error. 
Suppose for brevity that r]x > T a.s.. An approximation of the strategy n is 
n" defined as 

< = 7z,.fors€[T;,T;Vi) 

for a discretization scheme t". In this context, N[T"]t is the number of 
transactions up to time t < T. The discounted replication error is given as 



= e-'-'iVt - Vf) = finu - nl)dSu. 
Jo 



Notice that after a Girsanov-Maruyama transformation, X = 7i is a local 
martingale and (X, Y, X) = (n, S, n) £ S. According to our results in the 
preceding section, the lower bound of the asymptotic variance of VN[T"]tZ" 
is attained by the scheme 



where 



t"o = 0, t1^, = inf |i > T^; \nt - n,.;f = ele'h.n^ A T, 



Tt = -^{St,r{T-tlZt). 



Note that T is what is called gamma in financial practice. In this case, 
t" £Ti(X) with 

bs = 0, al^cl^ql^ e'T^, 

so that we have 



1 

6 Jo' 



1/2 

""'r,d<s) ' 



as the asymptotic distribution of Z" /e„, where Z is an independent standard 
normal variable. This scheme is efficient in that the conditional mean- 
squared replication error is asymptotically minimized for a conditionally 
given number of transactions. The number of transactions N[T"]t is of course 
random; it is high if the path of T is of high level because \nt — n^n^ IT.^>' « 

|St - St'/PFt^k. This property is intuitively expected in practice. Note that e„ 
controls the expected number of transactions. The asymptotic distribution 
of ^N[t%Z1 is 

4z r e-'-''r,d<S)„. 

V6 Jo 

In the equidistant case t" = ;/n, we can apply Theorem |2.6l to e„ = 1/ ^fn, 
M = W, 7 = laS, Y^S with 
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to have that VMr^Z" converges !F-stably to 



2 Jo 



,1/2 

-2rur-2 T. c2. 



The inequality for the asymptotic conditional variance 



follows directly from the Cauchy-Schwarz inequality. 
Karandikar's scheme is defined as 

= 0, t]^^ = inf |f > T^;; |7Zf - uA = e„| a T. 

After the Girsanov-Maruyama transformation, we apply Theorem 12.61 to 
X = M = 71 with 

&s = 0, fl^ ^ = = 1 
to have that { VN[T"]t-Z"} converges ;F-stably to 

The inequality for the asymptotic conditional variance 

l{X^-'"r,,d<s>„}%{X'r;d<s>.}{lj;'.-d<s>„ 

follows again directly from the Cauchy-Schwarz inequality. 

Taking the purpose of hedging into consideration, it might be preferable 
to use such a scheme t" that the asymptotic mean of Z"/e„ is negative. 
Proposition 13.31 presents an effective scheme for a given asymptotic condi- 
tional mean and a given asymptotic conditional number of transactions. 

More importantly, we can incorporate linear transaction costs. Suppose 
that the total cost of the delta hedging with a discretization scheme t" up 
to time f < r is proportional to 



/=0 



Let us study the asymptotic distribution of C"Z". After the Girsanov- 
Maruyama transformation, 7z is a local martingale as before. Notice that 

CO 

CnCl St»Nt» ,Af - 7TT«Atl + On{\) 

;=0 
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if t" € Ti(M) with M = 7z. Apply Theorem O to X = M = ti, Y = S and 
M = S to have that {C"Z"} converges !F-stably to 



{SQ-{S)t^-b-St + Z^-c^-{S)t 
and that the asymptotic conditional variance of {C"Z"} has a lower bound 

3 



1 j\e-^'^SuTlf''d{S)u 
which is attained by t" defined as 



0, t" 



inf { f > tJ; \nt - n,nf = ele^'""'! S,nTl„ ] A T. 



This scheme is efficient in that the conditional mean-squared replication 
error is asymptotically minimized for a conditionally given amount of linear 
transaction costs. 



5. Euler-Maruyama approximation 

Here we propose alternative discretization schemes for the Euler-Maruyama 
approximation as an application. Let us consider the stochastic differential 
equation 

dEt = fi{Et, rit)dt + o{Et, r]f)dWt, 
d;]t = eir]t)dt, 

where W is a one-dimensional standard Brownian motion and a, 6 are 
continuously differentiable functions. Since it is rarely possible to generate 
a path of H fast and exactly, the Euler-Maruyama scheme is widely used 
to approximate to H in simulation. For sequences t" = {t"} with the 
Euler-Maruyama approximation H" of H is given as 

dE';^^{E';,q^)dt + a{E';,if)dWt, 

dif; = eiit)dt, 

where H" = E"„, fj" - if^,, for j > with t e [t", t'? -^). Usually t" = j/n 

is taken. The convergence rate of the approximation has been extensively 
investigated; see e.g., Kloeden and Platen 1, 14 J for a well-known strong 
approximation theorem and Kohatsu-Higa IfTSi , Bally and Talay mill, Kon- 
akov and Mammen |16| for weak approximation theorems. Newton I20II 
treated passage times. Cambanis and Hu |3| studied efficiency of deter- 
ministic nonequidistant scheme. Hofmann, Miiller-Gronbach and Ritter IHl 
treated a class of adaptive schemes. Here we exploit a result of Kurtz and 
Protter [17J, Jacod and Protter [10| to deal with the asymptotic distribution 
of path wise error. Our aim here is to construct discretization schemes which 
are more efficient than the usual equidistant sampling scheme. 
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With the aid of localization, we can suppose ju, a, 1 /o, 6 and their deriva- 
tives to be bounded. Suppose that t" € 7~(W). By Theorem l2.6[ there exists 
a conditionally Gaussian martingale Z such that 

r-stably, where = for / > with t e [t'J, t^^^). Put L'} = e-'^{S'l - H). 
Then, applying Kurtz and Protter |ll7| , we have that L" converges to a 
process L which satisfies 

dLf = diniEt, rit)Ltdt + d^aiEf, Tjt) [LtdWt - o{Et, r]f)dZt] , 

where di refers to the differential operator with respect to the first argument. 
Solving this stochastic differential equation, we obtain 

Lf = -et I e;'^o{Es, ris)dio{Es, rjs) [dZ^ - d^oiEs, 7]s)d<Z, W)^] , 
Jo 

where 

et = exp I di^{Es, ris)ds + J dia{Es, r]s)dWs -\J^ dia{Es, ?]s)^ds| . 

Therefore, in light of Theorem |2.6[ the distribution of Lf is mixed normal 
with conditional mean 

—et es^oi^s, r]s)dio{Es, 7]s)fcs [dWt - dia{Et, r]t)dt] 

and conditional variance 

(25) ^e^ e;M^s, ns?dio{E„ r]sfc^dt. 

Proposition 5.1. For any T > 0, the space-equidistant scheme z"p defined as 
(|23l ) with M - W, e„ = n~^l-^, g - 1 is three times efficient than the usual 
time-equidistant scheme t"j^ = {;/n} in the following sense; for any t € [0, T), 

• £[N"] < nt and N^/n ^ t in probability asn ^ oofor both N" - N[T"p] 
and N" = N[t'^J, 

• the asymptotic conditional mean ofL" is Ofor the both schemes, 

• the asymptotic conditional variance of L" for t"^ is one third of that for 



4m- 



Proof: For the space-equidistant case, 

rN"-i 



bs = 0, a^,=c^,= 1, P[N"] = nP 



7=0 



< nt, 



while 

bs = 0, al = cl = 3, Nj" = [nt] 
for the time-equidistant case. //// 
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Newton pO) studied this space-equidistant sampling scheme; the su- 
periority of this scheme is more-or-less known. The above simple fact of 
asymptotic conditional variance, however, has not been recognized so far. 
The assumption that W is one-dimensional is a serious restriction. Never- 
theless, also for a stochastic volatility model 

dEt = fi{t, Et)dt + d(t, Vt)[p{t, Vt)dWl + ^1 - p{t,VtfdWj] 

dVt = pit, Vt)dt + a{t, Vt)dWl 

with a two-dimensional standard Brownian motion (W^, W^), a scheme 
defined as (|23l ) with M = and ^ = 1 results in a one third conditional 
asymptotic variance of the Euler-Maruyama approximation error for H. 
This is because in light of Theorem 12.61 discretization error is determined 
by only conditional moments of increments of integrand, which is a fimction 
of V independent of in this example. 

Next, let us consider to minimize ((25)) in case that dio is nondegenerate. 
Define t" as 

(26) = 0, tJ^, = mf{t > tJ; | - W,f = e{T^)}, 

where 

a(H^,„i]^„)(9ia(H;'„,7]^„) 

/ i i i 

and 

;-i 

log(e» = y j<9i^(H^,„ j]"^..){tI, - T^) + <9ia(H^,„ r]^„)(W," - W,-,) 

-i5ia(H^,„<„)^(T:Vi-<)). 
Then, Condition 12.51 is satisfied with 

9 7 9 

bs = 0, flg = Cs = flg = —— _ -. 

Froposition |3.2| implies that this adaptive scheme attains a lower bound for 
(|25)l among t" £ 7~(W). In this sense, this scheme is optimal. A disadvantage 
of this scheme is the difficulty to estimate the expected number of data. In 
other words, we cannot answer how to choose £„ so that the expected 
number of data is less than n. In practice, it will be better to use 

= 0, tJ^, = inf{f > T^, m - Wrf = e(Tp V e'J, 

for > in order to ensure that a simulation is done in a finite time. 

We conclude this section by a remark on generating the random variable 
(t, Wj) satisfying 

T = inf{f >0;|Wf- Wol = e} 



e(T") = 
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on a computer for a given e. It is sufficient that the distribution function 
of T is available because 

P[W^ = Wo + e] = 1/2 

and T, are independent. In fact, for a random variable U imiformly 
distributed on (0, 1), 

(T, - Wo) ~ {F-\2U - [2U]le{2[2U] - 1)). 

It is known that the density of t is given by 

(4n + If 6^ ^ 



oo , 

; ^ (4n + l)eexp< — 

n=-oQ ^ 



^JTjt^ ^ \ 2t 

See Karatzas and Shreve Hll, 2.8.11. Using the fact that 

Jo V27Zf3 Ja/^^t 

for a > 0, we obtain Fe{t) = G(e/ ^^t), where 

OO 

G(x) = 42]^(0((4n + 3)x) - 0((4n + l)x)). 



)i=0 



According to our numerical study, G(x) « 1 for < x < 0.1. This is not 
surprising because G(0+) = 1, G'(0+) = G"(0+) = 0. Note that If x > 0.1, the 
speed of convergence of the infinite series is very fast. We can therefore use 



G(x) 



4 L\^Jo^ (^'((4n + 3)x) - 0((4n + l)x)) x > 0.1, 
1 0<x<0.1 



for, say, N = 3 as a valid approximation of G. It is noteworthy that G is inde- 
pendent of e, so that once we obtain the inverse function of G numerically, 
it is done very fast to generate t repeatedly even if e changes adaptively as 
in <|26l(. Note also that 



G(x) < 4(1 - 0(x)), G-\ij) < 0-i(l - y/4). 

These inequalities will be useful in numerical calculation of G~^ for suf- 
ficiently small y ( large x ). Besides, if x > 3, G(x) ^ 4(1 - 0(x)) and 

G-i(y)«o-i(i-y/4). 



Appendix A. Auxiliary results 

Here we give auxiliary results for the proof of our theorems. The follow- 
ing limit theorem, which plays an essential role in this article, is a simplified 
version of a result of Jacod [9J and Jacod and Shiryaev |Tl|. Theorem IX.7.3, 
which extends a result of Rootzen [21 J. Let M = {Mt,'Ft,0 < t < oo} be 
a continuous local martingale defined on (Q, T^, P) and At""" be the set of 
bounded {!Ft}-martingales orthogonal to M. 
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Theorem A.l. Let {Z"} be a sequence of continuous {ftVlocal martingales. Sup- 
pose that there exist an {Tthadapted continuous process V = {Vt} such that for all 
Me M^ie [0,oo), 

(Z",M)f ^ 0, <Z",M)f ^ 0, {Z"}t ^ Vt 

in probability. Then, the C[0, oo)-valued sequence {Z"} converges T -stably to the 
distribution of the time-changed process Wy where W is a standard Brownian 
motion independent ofT- 

The following lemma is repeatedly used in our proofs. 

Lemma A.2. Consider a sequence of filtrations 

'H;c'H;;i, ;,neZ+ = {0,1,2,...} 

and random variables {JJJlygN with U" being -measurable. Let N"(A) be a 
{'HJ^stopping time for each n e Z+ and A which is an element of a set A. Let 
U(X) be a random variable for each A e A. If it holds that there exists Aq e A such 
that 

N"(Ao) 

N"(A) < N"(Ao) a.s. for all A £ A and P[\U1\^\'HJ_^] ^ 

7=1 

asn ^ oo, then the following two are equivalent; 
(1) 

N"(A) 



sup 

AeA 



7=1 



asn —> oo. 



(2) 



sup 

AeA 



N"(A) 

nuimi,] - u{A) 

7=1 

Here the convergences are in probability. 
Proof Note that 



fls n — > oo. 



sup 

AeA 



N"(A) N"(A) 
7=1 7=1 



< sup 




ken 


7=1 



where 



By the Lenglart inequality, we have 











sup 

keN 


En 

7=1 


> £ 





Y^E[\vifmj_,]>n 

7=1 
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for any e, ?] > 0. The result theri follows from the convergence 

oo N"(Ao) 
j=l ;=1 

in probability. //// 
The following lemma is well-known, so its proof is omitted. 

Lemma A.3. For all H e !P^, t e [0, T) and 61,62 > 0, there exists a bounded 
adapted left-continuous process H such that 

P[\H - Hf ■ {M}f > 61] < 62. 

The following lemma is taken from Fukasawa f5| . 

Lemma A.4. Let M be a continuous local martingale with E[{M)j] < 00 and 
suppose that t" e T{M). Then, for all t € [0, T), 

(27) sup |M^« As-M^"Asl^ = Op(en) 

;>0,0<s<t ' ■' 

as well as 

(28) sup KM)t" - {M)A = Op(e„), sup \M^n -MA^ = Op(en), 

0</<N[t«]( ' 0</<N[T«]t ' 

where N[T"]f is defined as In particular, 

(29) N[T"]t 00 a.s. 

asn ^ 00. Moreover, for all locally bounded adapted cag process f, it holds 

N[t"] 

(30) 

in probability, uniformly in t on compact sets of[0, T). 
Proof Note that 

N[t«], 
7=0 

since E[{M)t] < 00. By the assumptions, it follows 

N[t"], 

y E[(M,« -M,.0''rT«] = Op(4) 

7=0 

for k = 3,6 and with the aid of Lemma [A.21 we have 

N[t«], 

(M,-.^-M,«)' = 0p(4)- 

7=0 



N[t"], t 
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N[t"] 



sup \M,: - M,«|2 < i y (M,« - M,uf 



o<y<N[T» 



;=0 



1/3 



SO that the second of (|28)) follows. 

To show (|27)) , we use Doob's maximal inequality to have 



sup \Mjn .t-M-,n^i\^^\T-,n 

0<Koo ' ' 



IGl^ = o,{ei) 



for k -3,6. Using Lemma rA.2l again, we obtain 

N[T"]f 

y sup |M^«^ AS -Mt"asI^ = Op(4), 



y_Q 0<s<oo 



which implies (|27)) since 

sup |Mt" as - Mt"asP ^ i sup |Mt:« as - Mt:.v 



N[t"] 



;>0,0<s<i 



7=0 



1/3 



Using the Burkholder-Davis-Gundy inequality and Doob's maximal in- 
equality, we have also 



N[t«], 



7=0 

for k = 3,6, which implies the first of (|28l) in the same manner. Note that 
follows from 



N[T"]t + 1 > 



(M)f 



\{M),n^^ - {M)A 

To see (|30l ), again in light of Lemma [A.21 it suffices to observe that 



N[t«], t 



and 



N[t"], IV Lt it 



N[t" 

E 

7=0 



//// 
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Appendix B. Kurtosis-Skewness inequality 

Lemma B.l. Let Xbe a random variable with E[X] = and E[X^] < oo. Then it 
holds 

(31) E[Xl_E[X^ 

The equality is attained if and only ifX is a Bernoulli random variable. 

Proof: This is called Pearson's inequality and shown easily as follows. 

E{X^f = E[XiX^ - E[X2])]2 < E[X^]E[\X^ - E[X^]\^] = E[X^]{E[X'^] - E[X^]% 

If the equality holds, then X and X^ - E[X^] must be linearly dependent, so 
that X must be Bernoulli. //// 
The following lemma gives a similar inequality to the above. The proof 
is however rather different and the result itself is seemingly new. 

Lemma B.2. Let Xbe a random variable with E[X] = and E[X^] < oo. Then it 
holds 

m^ E[X^_3E[X3]2 E[X^] 



E[X2]2 4E[X2]3 - E[|X|]2' 
The equality is attained if and only ifX is a Bernoulli random variable. 

Proof: We divide the proof into 4 steps. 

Step a) It is straightforward to see that the equality holds if X is a 
Bernoulli random variable with E[X] = 0. 

Step b) Let us show if E[X] = and the support of X is a finite set, then 
the distribution P^ of X is a finite mixture of Bernoulli distributions with 
mean 0. First, consider the case n = 3. Suppose without loss of generality 
that 

P[X ^ a] ^ p, P[X = b] = q, P[X = c] = r, p + q + r = l 
with a > b > > c. Put 

Piia) = — , Pi(c) = — , Piib) = f^, P2(c) = ^ 



a - c a-c b - c b - c 

Then Pi and P2 define Bernoulli distributions with mean and supports 
{a,c} and {b,c] respectively. Putting A = (c - a)plc - P^{a)/Pi{a), we have 

APi(a) = p, (1 - A)P2{b) = q, APi(c) + (1 - A)P2(c) = r, 

which means 

P^ = APi + (1 - A)P2. 
Now, let us treat the general case by induction. Suppose that the claim holds 
for the case of n and consider the case of n + 1. Without loss of generality, 
we suppose 

P[X = aj] = pj, ; = 0,l,...,n, po + pi + ■ ■ ■ + p„ = 1 

with 

ao > ai > ■ ■ ■ > aic > > a^+i >•••>«„ 
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for an integer k,2 < k < n - 1. Put 



fli = 



Po + Pi 



, Pi = Po + Pi, fl; = a.], Pj = Pj, 2< j <n 



and 



Haj) = Pj, i = l,...n. 
Notice that P defines a distribution with mean and support {ai,a2, . . . ,a„}. 
By the assumption of induction, there exists A,y >0, l<i<k, k<j<n 
such that 

Here P,y defines a Bernoulli distribution with mean and support {ai,aj}. 
Now consider a distribution defined as 



Po+Pi ' ' Po+Pi 

for k < i < n. Notice that Qj is a distribution with mean and support 
{flO/fli/«;}- As seen above for the case n -3, putting ^ij - Qj{ao)/Poj{^o), we 
have 

Qy = fJyPO; + (l-fi;)Pl;, 

where we define 



PiMi) 



Ui — a J 



Pijiuj) = '—, <i <k, k < i <n. 



Ui — Uj 



Putting 

Aoj = [ijAij, Aij = (1 - l^j)Aij, Aij = Aij, 2<i <k, k < i <n, 
we have 

hi ''j 
which completes the induction. 

Step c) Let us show that the function f{u, v, w, y) defined as 



(33) 



f{u,v,w,y) - u - '^v^/w - iv^/y^ 



is a concave function. Note that the inequality (|32)l follows from Steps a, 
b and c since every distribution can be approximated arbitrarily close by a 
distribution supported by a finite set. By a straightforward calculation, the 
Hessian matrix of / is given by 



n —3- M. n 
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Again by a straightforward calculation, the determinant of H - xl is of 
form x^{x + a){x + |3) with a > 0, |S > 0, which means that H is negative 
semi-definite. 

Step d) It remains to show that the equality holds only if X is a Bernoulli 
random variable. Suppose that there exists a random variable X with 
E[X] = such that the equality holds in (|32]l which is not Bernoulli. Recall 
that the equality holds in ((3T|l only if X is Bernoulli. It implies that the vector 
of the first four moments of X does not coincide with that of a Bernoulli 
random variable. Note that there exists a random variable X of which the 
support is a finite set, such that 

E[X] = E[X], E[\X\] = E[|X|], £[X2] = E[X^l E[X3] = E[X^l E[X^] = E[X% 

This can be proved by the Hahn-Banach theorem. Hence, we assume the 
support of X is a finite set without loss of generality. Then, by Steps b and c, 
there exist Bernoulli distributions Pi and P2 and A e (0, 1) such that Pi ^ P2 
and 

(35) /(Ami + (1 - A)m2) = = A/(mi) + (1 - A) f (7712), 

where / is defined as (l33l > and 

m, = |j" a'^Piida), J a^Pi{da), J a^Pi{da), J \a\Pi{da)^ , i = 1,2. 

Here / means the transpose of matrix. By ([34l) , the eigenvectors of the 
Hessian matrix H associated to the eigenvalue are 

/7i = (1,0,0,0)', h2^iO,v,zv,yY. 

Therefore, ([35l > implies that there exists a constant c such that m2 = cfhi, 
where 

m, = |j^ a^Piida), J a^Pi(da), J \a\Pi{da)^ , i = l,2. 

It suffices then to show that fh2 = crhi implies c = 1 and that rhi uniquely 
determines a Bernoulli distribution. Set 

P2(fl) = p, P2{-b) = q, p + q ^ 1, ap ^bq, fhi ^ {v,w,y)' 

and 

(t'p - b^q = cv, a^p + h^q = civ, ap + bq = cy. 
Then we obtain that 

2wq 2wp 
a = , b = , 

y y 

so that 

— =fl^-fc^ = -^{l-2p). 

y y^ 

Therefore, a, b, p, q are uniquely determined independently of c, which com- 
pletes the proof. //// 
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